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Abstract 


Solving  Transportation  Problems  Via  Aggregation 


Richard  W.  Taylor 
School  of  Business  Administration 
University  of  Evansville 
Evansville,  IN  47702 


C.  M.  Shetty 

The  School  of  Industrial  and  Systems  Engineering 
The  Georgia  Institute  of  Technology 
Atlanta,  Georgia  30332 


_^This  paper  deals  with  the  solution  of  large  scale  transportation 
problems  by  aggregation  as  a  special  case  of  constraint  and  variable 
aggregation  in  linear  programming.  Using  cluster  analysis,  an 
appropriate  form  of  aggregation  will  be  used.  If  the  bounds  developed  at 


this  point  are  not  tight  enough,  a  suitable  unaggregation  step  will  be 


Key  Words:  Aggregation  methods,  transportation  problems,  large  scale 
problems 


1.  Introduction 


This  paper  deals  with  a  practical  strategy  for  solving  large 
transportation  problems.  Even  with  the  present  day  computers, 
transportation  problems  can  be  encountered  that  are  too  large  to 
solve  practicably.  Further,  in  many  instances  a  more  cost-effective 
standpoint  might  be  to  seek  only  approximate  solutions  to  these  problems. 
Currently  strategies  are  available  to  solve  large  problems  by  taking 
advantage  of  the  special  structure  of  transportation  problems.  The 
proposed  algorithm  utilizes  these  strategies  while  also  providing  a  means 
of  reducing  the  total  problem  size.  The  application  of  aggregate 
programming  to  the  transportation  problem  seems  a  natural  one  as  it  is 
intuitively  appealing  to  aggregate  sources  or  destinations  that  have 
other  similar  characteristics. 

The  main  thrust  of  the  proposed  strategy  is  the  aggregation  of 
constraints  and/or  variables  to  form  a  related  transportation  problem. 
This  problem  is  used  as  a  starting  point  for  a  sequence  of  problems,  each 
more  restrictive  than  the  preceding.  Bounds  on  the  optimal  solution  of 
the  original  problem  at  each  stage  are  also  computed.  At  each  stage,  we 
address  the  question  of  which  constraints/variables  we  would  like  to 
aggregate/ unaggregate  in  order  to  expedite  moving  towards  an  acceptable 
solution. 

The  general  approach  of  aggregate  programming  has  been  widely 
applied  in  many  areas  of  practical  research,  such  as  economics  [7], 
input/ output  analysis  [1,3, 6, 8],  and  production  planning  [4].  See  [14] 
for  an  extended  survey  and  references.  Unfortunately,  little  theoretical 
insight  is  available  in  these  areas . 
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2.  Related  Work 

The  classical  transportation  problem  can  be  stated  as 

M  N 

(TP)  Min  £  I  c.,x.,  (1) 

i-1  j-1  13  13 

N 

s  *t  •  I  x . .  —  a .  i-1,  2, . . . ,M  (2) 

J-1  J 

M 

£  x..  -  b  J-1.2,...,N  (3) 

i-1  3  3 

x^j  >  0  for  each  i,j 

M  N  * 

where  without  loss  of  generality,  we  assume  that  £  a.  -  £  b,.  Let  x 

i-1  1  j-1  3 

* 

be  the  optimal  solution  with  z  the  optimal  value  at  this  point. 

An  important  aspect  to  consider  when  solving  transportation  problems 
by  aggregate  programming  is  that  the  aggregate  programs  also  be 
transportation  problems  so  as  to  take  advantage  of  the  special  structure 
in  transportation  problems. 

Ue  will  first  consider  the  case  of  aggregating  destinations  (see 
Figure  1).  Later  we  will  consider  the  consequences  of  concurrent 
aggregation  of  sources  and  destinations. 

The  general  areas  of  study  in  aggregate  programming  as  applied  to 
the  transportation  problem  can  be  separated  into  three  sections : 

1)  Selection  of  the  destinations  to  aggregate, 

*  * 

2)  Derivation  of  bound  information  on  x  and  z  based  on  the 
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solution  of  the  aggregate  transportation  problem,  and  finally 
3)  Reformulation  of  the  aggregate  program  if  2)  does  not  provide  an 
adequate  solution. 

For  brevity,  we  hereon  refer  to  the  above  areas  as  aggregate  development, 

model  adequacy,  and  problem  reformulation. 

In  the  literature,  there  has  been  little  work  done  on  developing  the 

Initial  aggregate  program.  Most  of  the  research  assumes  some  prior 

knowledge  of  how  the  problem  is  to  be  partitioned.  They  assume  that  the 

destinations  to  be  combined  are  known.  Furthermore,  frequently  the 

actual  aggregations  are  made  with  equal  weightings.  Balas  [2]  proposes 

that  aggregation  may  be  applied  to  all  origins  (destinations)  that  are 

"neighbouring'*  based  on  the  values  of  the  c^’s.  The  definition  of 

neighbouring  is  intentionally  left  vague  as  his  emphasis  is  on  the  scale 

of  aggregation.  In  this  respect,  he  concludes  with  an  intuitively  argued 

best  initial  aggregate  program  size.  Although  his  suggestion  is  more 

appropriate  for  his  particular  algorithm,  his  thoughts  on  certain 

tradeoffs  is  applicable  to  the  level  of  aggregation  in  any  scenario. 

Zipkin  [20]  and  Geoff rion  [9]  both  suggest  that  the  area  of  cluster 

analysis  might  be  useful  in  developing  the  initial  aggregate  program.  In 

this  respect,  w  will  adopt  specific  clustering  techniques  that  are  both 

efficient  and  adequate  for  our  purposes. 

Now  suppose  the  aggregate  program  has  been  developed  and  solved  to 

yield  a  solution  point  xa  with  associated  value  za.  Zipkin  [20]  and 

Geoffrion  [9]  have  developed  upper  and  lower  bounds  for  the  original 

* 

problem  solution  value  z  using  the  aggregate  program  solution.  These 
bounds  will  be  slightly  extended  for  use  in  our  proposed  procedure. 


Further,  a  theoretical  groundwork  will  be  laid  for  further  tightening  of 
these  bounds.  This  extension  will  utilize  a  method  of  marginal  analysis 
similarly  used  by  Mendelssohn  [17]  and  Kallio  [12]. 

Finally,  a  recent  paper  by  Huberman  [10]  has  captured  some  of  the 
above  bounds  In  a  single  cohesive  format.  Also  adaptations  of  some  of 
the  earlier  bounds  have  been  made  to  the  multicommodity  network  and 
generalized  transportation  problem  by  Evans  [5]. 

In  many  instances,  the  above  described  bounds  are  not  adequate  and  a 
new  aggregate  problem  is  formulated  in  an  attempt  to  attain  tighter 
bounds.  Zipkin  [20]  suggests  a  possible  method  whereby  aggregate  terms 
are  separated  into  two  new  aggregate  terms  in  the  subsequent  problem. 

Our  proposal  will  present  a  variation  on  this  suggestion  that  has 
provided  lower  computation  times  in  preliminary  testing. 

Finally,  Balas  [2]  provides  a  method  for  aggregating  transportation 
models  and  then  successively  solving  these  modified  problems  as  a 
solution  procedure.  His  emphasis  is  placed  on  achieving  optimal 
solutions  and  as  our  procedure  will  suggest,  a  tradeoff  exists  between 
increased  accuracy  of  the  solution  and  computation  time.  No  computation 
times  are  given  by  Balas.  Lee  [15]  extended  Balas*  work  to  the  general 
minimum  cost  network  flow  problem.  These  methods,  as  well  as  the  general 
relaxation  solution  strategies,  are  used  in  part  in  this  study. 

3.  Development  of  the  Procedure 
For  the  moment  we  will  only  consider  the  case  of  aggregation  of 
destinations;  later  we  will  address  the  issue  of  concurrent  source  and 
destination  aggregation.  Recall  that  the  proposed  approach  consists  of 
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1)  aggregate  problem  development;  2)  model  adequacy;  and  3)  problem 
reformulation.  These  are  discussed  In  detail  below. 

3.1  Development  of  the  Initial  Aggregate  Program 

To  develop  an  Initial  aggregate  transportation  problem  we  must 
decide  first,  which  destinations  to  aggregate  and  secondly,  with  what 
welghtages  they  should  be  aggregated.  As  In  Figure  1,  let  us  first 
consider  a  general  aggregation  of  original  destinations  jEPr  into  a 
single  aggregate  destination  r.  This  corresponds  to  aggregating  each  of 
the  demands  bj  for  j  eP^  into  a  demand  b^.  Additionally,  the  flows  from 
each  of  the  sources  into  these  destinations  can  be  aggregated  for  all 
jePf.  In  particular,  aggregation  of  the  demands  b^  corresponds  to  the 
aggregation  of  the  constraints  in  Eqn.  (3)  for  jeP^  with  multipliers 
tj«l, 2, . . . ,N.  Aggregation  of  the  flows  from  each  of  the  sources  Into  the 
destinations  j«Pr  corresponds  to  the  aggregation  of  M'sets  of  variables 
je*r  with  multipliers  s^j.  These  constraint  and  variable 
aggregations  can  be  performed  for  all  sets  Pr,  r-1,2, ...,L  to  give  a 
general  aggregated  transportation  problem  (TP-CV)  as  follows: 

M  L 

(TP-CV)  Min  I  Z  Z  s.,c..x.  (4) 

i-1  r-1  jePr  ir 

L 

s.t.  Z  Z  8^4xir  ”  i-l,2,...,M  (5) 

r-1  j  eP  •  lr  1 


DESTINATION 


SOURCE 


Variable  Aggregation 


Demand 


Figure  1 

Aggregation  of  Destinations 


M 


E  C1  2  8ijxir  “  E  t1bj  r“1,2,***,1‘ 

jePr  J  i-1  jePr  J  J 


s^j  x^r  >  0  for  each  i,j,r 


(6) 


(7) 


As  stated  earlier,  it  is  desirable  that  the  aggregate  program  be  a 
transportation  problem.  Let  us  view  what  conditions  are  required  to  put 
the  program  (TP-CV)  in  the  form  of  an  M-source,  L-destination 
transportation  problem  as  shown  below: 


(ATP) 


M  L 

za  *  Min  E  E  c.  x. 

,  ,  .  ir  ir 

i-1  r-1 


s.t.  E  xir  -  at 
r-1 


i-l,2,...,M 


M 

E  x«r  “  br  . . L 

i-1 


x^r  >  0  for  each  i,r 


(8) 


(9) 


(10) 


(ID 


Comparing  Eqn.  5  with  Eqn.  9  and  Eqn.  7  with  Eqn.  11,  we  must  have 


E  s.,  -  1  for  each  i  ;  s. .  >  0  for  each  i,j 
jepr  iJ  U 


(12) 


Also  from  Eqn.  6  and  Eqn.  10,  and  setting  the  aggregate  demand  bf  equal 
to  the  sum  of  the  demands ,  we  have 
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Z  t.b.  ■  Z  b,  ■  b  for  each  r  (13) 

IjP  igp  J  r 

J  r  .  r 

Again  f rcn  Eqn.  6  and  Eqn.  10  we  will  need 

Z  t.s.,  ■  1  for  each  i,r  (14) 

jeP  J  J 
J  r 

If  Che  conditions  of  Eqns •  12,  13,  and  14  are  met  then  the  program 
(TP-CV)  will  be  a  transportation  problem.  It  Is  easy  to  see  that  there 
are  an  Infinite  number  of  multipliers  tj  and  s^  that  will  satisfy  these 
requirements.  For  example,  let  tj*l  for  each  j  and  then  any  convex 
combination  of  the  s^'s  j«i?r  will  work.  Later  in  this  section  we  will 
specifically  address  the  issue  of  the  multipliers  to  be  used  in  our 
algorithm,  tfe  first  address  which  destinations  should  be  clustered 
together . 

It  should  be  apparent  that  if  we  are  to  preserve  the  specialized 
structure,  then  aggregation  of  the  demand  constraints  will  also  define 
an  aggregation  of  the  flows.  Thus  if  we  aggregate  demand  constraints  for 
j£Pr,  we  must  similarly  aggregate  the  variables  x^j,  j£Pr  f°r  each  1. 
Conversely,  the  partitioning  of  the  flows  will  also  determine  the 
partitions  for  the  demand  constraints.  In  this  light,  we  can  discuss 
either  the  aggregation  of  the  primal  demand  constraints  or  the  dual  flow 
constraints  (corresponding  to  the  primal  variables). 

To  determine  the  constraints  to  aggregate,  a  previus  study  [18] 
suggested  the  angle  between  the  normals  as  a  primary  measure  of 
"closeness"  of  constraints,  and  the  closeness  of  the  right-hand-side 
values  as  a  secondary  measure.  In  the  case  of  the  transportation 


problem.  It  Is  not  difficult  to  show  that  in  both  the  primal  and  the  dual 


problems,  the  angles  between  all  pairs  of  constraints  are  close.  Hence, 
for  a  given  pair  of  destinations  q  and  k,  we  want  to  find  the  closeness 
of  the  vectors  c^  and  c^  which  are  the  right-hand-sides  of  the  dual 
constraints .  However  comparing  this  for  all  pairs  can  be  computationally 
Inefficient.  Hence,  we  will  compare  each  vector  with  an  "average  vector 
w  defined  by 


W  <ci+c2+  “*  +CN> 

by  computing  the  angle  between  c^  and  w  given  by: 

„  -1  C,1>W 
hj  "  COS  lie j  U  Uw  9 

Then  if  L  partitions  (clusters)  are  desired,  it  seems  desirable  to 
define  angles  0  <  e^  <e2  <,...,  <  eL-i  *  180,  and  let 

Cluster  ■  (j:  0  <  h^  <  e^) 

•  Cluster  ?2  *  (j:  e^  <  hj  <  e£) 

Cluster  PL  «  (j:  e^^  <  h^  <  180) 

Of  course  in  the  actual  partitioning  procedure,  one  does  not  need  to 
perform  either  the  cos  \  •)  operation  or  the  common  division  by  Jw  I,  as 
long  as  the  appropriate  modifications  are  made  to  the  e^'s.  This  method 
also  has  the  advantage  that  it  is  computationally  efficient. 


10 


A  final  Coplc  In  the  partitioning  of  the  destinations  is  the 
determination  of  the  number  of  and  the  values  of  the  e^'s.  The  e^'s  are 
chosen  to  uniformly  partition  the  interval  [0,180].  As  will  be  more 
evident  in  the  reformulation  process,  it  makes  sense  to  require  that  a 
minimum  number  of  destinations  be  contained  in  each  cluster.  For  our 
purposes,  the  minimum  number  was  chosen  as  three  destinations  (sources) 
per  cluster  which  allows  for  at  least  two  iterations  of  the 
reformulation  process. 

We  developed  above  a  procedure  to  partition  the  destinations  into  L 
clusters,  r*l, 2, . . . ,L,  based  on  variable  aggregation  of  the  flows. 

In  order  to  retain  the  specialized  structure  of  the  transportation 
problem,  the  above  variable  aggregation  forces  a  specific  constraint 
aggregation  of  the  demand  constraints  as  discussed  earlier.  Thus  the 

clusters  jePf  *“1*2 . .  will  also  be  used  to  give  L  aggregate 

constraints  of  the  form: 


9 


M 

I  x.  “  E  b,  r*l, 2, . . . ,L 

i“l 


The  previous  section  gives  the  procedure  for  the  partitioning  of  the 
destinations.  The  next  step  in  our  algorithm  is  to  develop  the  ® 

multipliers  for  combining  the  associated  constraints  and  variables. 

Comparing  Eqn.  4  with  Eqn.  8,  to  make  the  total  costs  equal,  we  must 

have  9 


Clr  *  ‘  slj  clj 
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Further  If  t.  ■  1.  then  x.  ■  E  x..  .  To  make  the  costs  associated 
3  ir  k£p  ik 

r 

with  x^^  equal  to  the  sum  of  the  costs  associated  with  the  aggregated 
flows,  we  must  have  c^x^  "  Z  CijXij  90  tliat 


S  c  x 
Jepr  U 


Z  x 


keP 


ik 


Comparing  the  expressions  for  c^^  we  have 


’ij 


lil 


keP 


wik 


Since  the  flows  x^  are  not  available,  it  seems  reasonable  to  assume  that 
the  flows  to  the  destinations  aggregated  are  likely  to  be  proportional  to 
the  demand  so  that  a  good  approximation  for  s^  will  be 


A 

s 


ij 


for  each  j  d*r  and  for  each  i,r 


(15) 


/N 


Note  that  Cj"l  and  £ ..  ,  defined  above  satisfy  the  conditions 
necessary  for  insuring  that  the  aggregate  program  is  a  transportation 
problem.  As  we  shall  see  in  the  section  on  bounds,  use  of  these 

* 

multipliers  will  also  provide  us  with  an  immediate  upper  bound  on  z  . 
Incidentally,  these  multipliers  are  precisely  the  pro-rata  demand 
multipliers  used  by  Zipkin  [20]  and  Geoff rion  [9]. 


3.2  Model  Adequacy;  Bounds 


The  previous  section  develops  a  procedure  to  fora  an  initial 
aggregate  transportation  problem.  In  this  section,  we  develop  bounds  on 
the  original  problem  solution  from  the  solution  to  the  aggregate 
problem. 

Consider  the  initial  aggregate  problem  (ATP).  The  problem  can  be 

solved  to  obtain  a  solution  x  with  solution  value  z  and  optimal  dual 

multipliers  (u  ,v  )•  Zipkin  shows  that  the  solution  value  z  forms  an 
* 

upper  bound  on  z  . 

*  * 

Theorem  1  [Zipkin]:  Let  x  solve  (TP)  with  value  z  .  Let  (ATP)  be  the 
aggregate  program  of  (TP)  formed  by  the  aggregate  multipliers 
s^  in  Eqn.  15  and  tj“l  for  all  j*  Let  xa  solve  (ATP)  with 
value  za.  Then 


<  z 


In  order  to  obtain  feasible  points  to  (TP)  it  is  necessary  to 
disaggregate  the  solution  xa  (size  MLxl)  to  a  vector  of  size  MNxl.  Two 
methods  of  disaggregation  are  discussed  in  [19].  The  first  and  simplest 
to  use  is  fixed  weight  disaggregation.  We  can  obtain  a  feasible  solution 
x*  to  (TP)  by  disaggregating  xa  ■  [xaf]  as  follows: 


/v  a 

8ijxir 


for  each  JePr,  for  each  r  and  for  each  i 


where  s^  are  the  pro-rata  demand  multipliers  from  Eqn.  15.  Hence  an 

* 

upper  bound  on  z  will  be  as  follows : 
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*  .  f 
z  <  z 


M  N  f 

i-1  j-1 


An  Important  Issue  to  note  Is  that  a  feasible  solution  x  can  be  found  by 
this  method  regardless  of  the  multipliers  s^  used  to  form  the  initial 
aggregate  program.  A  consequence  of  using  s^  to  form  the  initial 
aggregate  program  is  that  za  -  zf. 

A  second  alternative  is  to  use  the  method  of  optimal  disaggregation 
as  discussed  by  Balas  [2]  and  Zipkln  [19].  This  method  entails  solving  a 
transportation  problem  for  each  cluster  Pf  r-1,2,  ...,L  in  order  to 
produce  optimal  flows  within  each  cluster.  Thus  solving  the  following 
programs  for  each  r-1,2, . . . ,L: 


z  *  Min  Sic. .x. . 
i-1  jePr 


I  x„  -  x±  i-l,2,...,M 
jEPr 


I  x, ,  -  b.  for  each  JeP 
i-1  3  r 


x^j  >  0  for  each  i,  for  each  j ePr 


we  will  get  a  feasible  solution  to  (TP)  with  solution  value  z*  -  I  zr. 


Zipkin  also  shows  that  the  upper  bound  formed  by  the  optimal 
disaggregation  procedure  is  at  least  as  good  as  the  fixed  weight 
disaggregated  solution  value.  The  optimal  disaggregation  procedure  does 
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however  require  considerably  more  computation  time,  as  a  transportation 
problem  must  be  solved  for  each  aggregate  cluster.  These  two 
alternatives  were  computationally  tested  to  determine  the  conditions 
which  favours  each  method.  Preliminary  results  indicate  that  the  optimal 
disaggregation  method  is  most  effective  at  the  later  stages  when  the 
bound  obtained  is  close  to  the  desired  accuracy.  In  these  cases,  the 
expense  of  the  additional  computation  time  of  solving  these  sub¬ 
transportation  problems  is  offset  by  achieving  sufficient  additional 
accuracy. 

it 

We  now  proceed  to  develop  a  lower  bound  on  z  .  Let  (u,v)  be  a  set 
of  dual  variables  with  components 

u^  ■  u®  and  v^  ■  v®  for  jePr  and  r  •  1,2 . L. 

A  A  A 

when  u^  and  vf  are  the  optimal  dual  multipliers  to  (ATP).  Let  (d,d)  be 
an  arbitrary  direction.  Then 

*  N  M  N  M  A  „  * 

z-  E(v+6d)b  +  E(u+0d.)a  -  E  E  [(v  +M  )  +  (u  +M  )  -  cf]x.  . 

j-1  J  J  J  i-1  1  1  1  j-1  i-1  J  J  1J 


The  first  two  terms  are  the  dual  objective  function  value  at  (u,v)  + 


6(d,d)  and  the  last  term  is  a  penalty  for  violating  the  dual  constraints. 


N 


M 


M 


Noting  b  ■  E  b.  ,  z  “  E  v.b,  +  E  u.a.,  and  E 
r  keP  K  j-1  J  J  i-1  1  i-1 


ij 


b  j ,  we  have 


*  N  M  N  M 

z  -  za  +  0(  E  b  d  +  E  ad)  -  E  E  [(v  +  fld.) 

^  J  i*l  1  x  i1"!  J  J 


+  (Ut  +  ad^  -  CylXy 


N 


M 


>  2“  +  0(  E  b  d  +  Z  ad) 
j-1  2  2  i-1 


+  <UI  +  Mi>  -  cijlbj 


N 

E  max[(v.  +  9d.) 
j-1  i  2  2 


Zipkin  [20]  derives  the  same  bounds  using  a  relaxation  procedure  when 

0-0. 


However  for  each  destination  j,  we  are  allocating  the  full  demand 


M  * 

E  x. ,  -  b.  to  the  particular  term  corresponding  to  max  [(v  +  0i  )  + 

1-1  3  2  i  2  2 

(u®  +  0d^)  -  c^j].  If  however  we  reallocate  the  total  amount  b^  to 

specific  Indices  1  so  that  an  allocation  for  any  cell  (l,j)  will  be  no 

* 

greater  than  min  (a^b^)  then  a  better  bound  on  z  is  available.  Thus  if 

A  A 

[(Vj  +  0dj)  +  (u“  +  Od^)  -  Cjj]  are  in  descending  order,  we  have 


N 


M 


M  N 


z  >  2  +  0(  E  b.d  +  E  a  d.)  -EE  [(v  +  0d  ) 
j-1  22  1-1  1-1  j-1  2  2 

i-1 

+  (u*  +  0dt)  -  c^J  max  {0, tiling, b^  -  E  a8l  } 


Hereinafter  the  right  hand  side  of  the  above  expression  will  be 
denoted  as  z(  0)  which  is  stronger  than  the  bound  derived  earlier.  Both 
methods  of  bounding  were  Incorporated  into  the  full  algorithm  and  it  was 
found  that  both  methods  required  less  than  IX  of  the  total  time  to  solve 


Che  problem.  However  in  a  preliminary  comparieon  of  Che  accuracy  of  Che 
resulcing  bounds,  ic  was  found  Chac  ouC  of  40  observacions ,  Che  proposed 
lower  bound  was  on  Che  average  16. 8X  (sCandard  deviation  4.3Z)  better 
Chan  Zipkin's  bounds. 

The  above  discussion  clarifies  Che  lower  bound  Co  be  used  in  our 
proposed  algorichm  is  sCaCed  precisely  in  Theorem  2  below. 


*  * 

Theorem  2  Lee  x  solve  (TP)  wlch  value  z  .  Let  (ATP)  be  Che  aggregate 


program  of  (TP).  Let  xa  solve  (ATP)  with  value  za  and  optimal 
dual  multipliers  (ua,va).  Let  v  ■  va  for  each  jEPf  for  each 
r.  Let  the  terms  [(Vj  +  fidj)  +  (ua  +  Bd^)  -  c^]  be  reordered 
over  the  index  i  Co  form  a  nonincreasing  sequence.  Lee 


M 


M  N 


z(  0)  *  z  +  9(  E  b  d  +  Z  ad)  -  Z  Z  [(v  +  0d  ) 
j-1  J  J  i-1  i-1  j-1  3  J 


i-1 


(ui  +  W1)  -  ctj]  max{0,min[a^,bj  -  Z  ag]} 


8“1 


*  *  * 

Let  6  -  argraax  z( 8) .  Then  z(  0  )  <  z 


Theorem  2  above  gives  a  general  lower  bound.  However,  in  order  to 
use  the  bound  it  will  be  necessary  co  specify  a  direction  of  search  and  a 
line  search  method.  First  let  us  look  at  the  function  z(9).  As 
discussed  in  [17]  and  [18],  the  function  z(  0)  will  be  piecewise  linear 
and  convex  with  breakpoints  0.  .  as  defined  below 


for  each  i,j 


Ak 


Note  that  even  for  a  relatively  email  transportation  problem,  there  are  a 

large  number  (MxN)  of  breakpoints.  Clearly  it  is  important  that  as  few 

* 

of  these  -  breakpoints  be  used  as  possible  in  finding  6  .  Certainly  the 
bounds  produced  by  the  line  search  for  bounding  will  be  at  least  as  good 
as  when  (HO.  However  additional  computation  time  will  be  required  for 
the  line  search.  In  order  to  provide  preliminary  testing  of  the 
algorithm,  the  computational  runs  will  only  use  the  value  (HO.  This 
assignment  also  eliminates  determination  of  the  specific  directions  d  and 
d  at  this  stage. 

Recall  that  the  bound  in  Theorem  2  was  formed  by  allocating  the 
*  * 

demand  £  x. .  -  b.  to  particular  cells  i,  1-1,..., M.  A  similar  bound 
i-1  3  3 

«  * 

could  have  been  formed  by  allocating  the  supply  £  x  -  a  to 

j-1  3 

particular  cells  j  j-1 . N.  This  procedure  would  yield  a  lower  bound 

z'(  8)  of 


z'(8)  -  + 


N 

0(  £ 


M 

£ 


M 

£ 


N 

£ 


b  d  +  £  ad)  -  £  £  [<v  + 

j-1  3  3  i-1  1  i-1  j-1  3 


V 


a  3-1 

+  (u“  +  8d.)  -  c.,1  max  {0,min  [b,,  a.  -  £  b  1} 


§L. 
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la  Che  Halted  problems  tested,  both  bounds  were  relatively  close  to  each 

other.  As  discussed  earlier,  the  time  to  compute  these  bounds  is  a  small  • 

fraction  of  the  entire  solution  time  hence  it  seems  reasonable  to  compute 

both  of  these  bounds  at  each  iteration.  Then  the  maximum  of  these  two 

bounds  can  be  used  as  the  best  lower  bound.  Incidentally,  Zipkin  [20]  „• 

also  shows  these  parallel  sets  of  bounds  for  his  particular  bound 

formulation.  His  testing  also  does  not  provide  any  insight  into  the 

conditions  of  when  one  approach  provides  a  better  bound  than  the  £ 

alternative.  We  should  note  that  these  same  bounds  are  also  derived  by 
Geoffrlon  [9]  and  Huberman  [10],  although  by  a  different  approach. 

By  the  combination  of  Theorems  1  and  2  we  have  the  following  bounds  ^ 

on  the  optimal  solution  of  the  original  problem 


<  z  <  z 


where 

zP  ■  max{z(0),  z'(0)} 


9 


If  the  bounds  above  are  adequate  then  we  can  stop.  If  not,  then  it 
is  desirable  to  form  a  new  aggregate  problem  chat  will  provide  tighter 
bounds  than  those  described  above.  For  this  purpose  we  define  in  general 
the  lower  bound  as  zp  and  the  upper  bound  as  z^,  where  at  this  stage  of 

p  a 

the  algorithm  zf  ■  z  and  z^  ■  z  .  It  is  convenient  to  measure  the  error 
at  this  stage  by  an  error  bound  level  defined  by 


9.  -J 


Reformulation  of  the  Aggregate  Problem 


In  many  instances  the  bounds  provided  by  the  aggregate  program  may 
be  satisfactory.  If  not,  a  method  of  reformulation  is  adopted  to  tighten 
the  available  bounds.  This  process  will  be  accomplished  by  unaggregating 
one  of  the  destinations  from  each  aggregate  cluster. 

Several  reformulation  methods  are  discussed  in  the  literature. 

Zipkin  [20]  presents  a  rather  broad  algorithm  that  contains  numerous 
possibilities  for  the  regrouping  of  aggregate  nodes.  His  main  suggestion 
is  to  divide  each  constraint  cluster  into  constraints  that  are  violated 
at  the  current  solution  and  those  constraints  that  are  not  violated  at 
the  current  solution.  This  alternative  is  eliminated  since  in 
preliminary  testing,  the  size  of  the  aggregate  programs  and  hence  the 
computational  effort  in  solving  the  subsequent  aggregate  programs  grows 
more  rapidly  than  the  accuracy  of  the  bounds.  Zipkin  offers  additional 
possibilities  where  some  but  not  all  of  the  clusters  might  be 
reformulated  or  that  some  other  criteria  other  than  the  violation  of  the 
constraints  might  be  used. 

Balas  [2]  uses  another  reformulation  algorithm,  namely  once  a 
cluster  has  been  identified  that  contains  a  violated  constraint,  then 
all  of  the  destinations  in  that  cluster  are  unaggregated  in  the 
subsequent  program.  This  method  has  a  similar  drawback  to  Zipkin' s 
method  in  that  the  size  of  the  aggregate  problem  quickly  aproaches  the 
order  of  the  original  problem.  However,  Balas*  method  is  focused 
towards  solving  the  original  problem  to  optimality  without  consideration 
of  any  resource  constraints  that  may  limit  the  size  problem  solvable.  An 
additional  point  is  that  Balas'  method  uses  different  aggregate  methods 
to  obtain  his  initial  aggregate  program,  hence  his  reformulation  scheme 


is  partially  based  on  this  particular  aggregation.  We  eliminate  the 
strategy  of  constructing  an  entirely  new  initial  aggregate  program  as 
requiring  considerable  effort  without  guaranteeing  that  the  bounds  at 
subsequent  stages  will  prove  to  be  any  better  than  the  ones  already  at 
hand. 

The  proposed  method  is  to  unaggregate  only  the  "most  violated 
destination"  in  each  cluster,  which  seems  to  yield  a  favorable  tradeoff 

between  increase  in  the  aggregate  problem  size  and  decrease  in  the  error 

bound  level.  To  do  this,  we  unaggregate  the  destination  j  within  each 
cluster  that  has  the  most  violated  constraint,  i.e.,  the  unaggregated 
destination  j  in  cluster  Pf  is  defined  as 

Jr  «  &£*r:  [u*  +  vk  -  clkl  >  tuj  +  Vj  -  cij]  for  each  j£Pr  and 

[u*  +  Vj  -  ClJ]  >  0)  (16) 

This  method  also  has  the  advantage  that  the  values  [Uj+Vj-c^]  are 

readily  available  as  they  represent  the  levels  of  violation  in  the  error 

bound  z(8).  Further,  the  aggregate  multipliers  s^  can  be  easily  updated 
in  the  subsequent  program,  where  the  multipliers  in  aggregate  destination 

-*>  A 

r  will  be  — ’)  for  jePr,  ^r’  T*ms  8iven  an  aggregate  program, 

r  ^r 

we  can  construct  a  scheme  that  produces  a  reformulated  aggregate  program 
as  follows: 

1.  For  each  cluster  Pf,  unaggregate  destination  J  as  defined  in 

A  ^ 

Eqn.  16  to  form  two  new  clusters  (Jr)  and  (JePr,  j*Jr).  Do  for 
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dll  t"l |2t * • • f L* 

2.  Reaggregate  using  the  clusters  in  step  1. 


The  above  procedure  describes  the  method  of  reformulating  the 
Initial  aggregate  program  Into  a  subsequent  aggregate  programs.  The 
subsequent  programs  are  based  on  the  preceding  programs,  hence  the 
updating  from  one  program  to  the  next  Is  minimal.  Further,  at  Iteration 

no 

k,  we  can  construct  bounds  for  the  program  ATP'  using  Theorem  1  to 

00  (k) 

yield  an  upper  bound  z  and  Theorem  2  to  yield  a  lower  bound  z'  . 

d  r 

Since  the  program  ATP^+1^  is  a  restriction  of  the  preceding  program 
ATP^k),  we  have: 


Theorem  3  Let  x^k^  solve  ATP^  with  value  z^  .  Let  ATP^1^1^  be  the 
a  a 

reformulated  program  of  ATP^k\  Let  x^k+1^  solve  ATP^+^ 

with  value  z(k+1).  Then  z(k)  >  z(k+1) 
a  a  a 


Since  every  iteration  yields  a  better  upper  bound  than  the  previous, 
then  at  iteration  k  we  can  define  the  current  best  upper  bound  z^,  as 


(kl 

In  a  similar  manner  for  every  aggregate  progran  ATP  '  we  can  derive 

00 

a  lower  bound  z£  .  Unfortunately,  the  lower  bounds  are  not  necessarily 
monotonlcally  increasing.  However,  if  we  define  the  lower  bound  zf  as 
the  maximum  of  all  the  lower  bounds,  i.e.  as 


max 


«<2)  - 


.00  i 
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then  the  error  bound  level  of 


will  be  nonincreasing  at  every  iteration.  In  practice,  it  is  generally 
strictly  decreasing,  as  the  conditions  yielding  otherwise  are  rare. 

In  summary,  given  an  initial  aggregate  program,  it  can  be 
reformulated  by  unaggregating  particular  violated  constraints  in  selected 
dusters  to  form  a  subsequent  aggregate  program  that  is  still  a 
transportation  problem.  The  bounds  formed  from  the  subsequent  programs 
are  successively  tighter  and  thus  the  algorithm  can  be  continued  until 
adequate  bounds  are  attained. 

Note  that  the  above  method  of  reformulating  the  aggregate  problems 

(k) 

will  result  in  a  sequence  of  aggregate  solutions  z^  ,  that  will  converge 

to  the  optimal  solution  value  of  (TP)  In  a  finite  number  of  iterations. 

(k) 

This  is  so,  since  at  each  stage  the  number  of  destinations  in  ATP 
Increases  by  at  least  one,  hence  in  at  most  N  -  L  steps  (where  L  was  the 
number  of  aggregate  destinations  in  ATP^  ^),  the  initial  problem  will  be 
at  hand. 

Extension  to  Source  Aggregation 

For  the  most  part,  the  above  methods  have  been  illustrated  for  the 
case  of  only  destination  aggregation.  However  the  principles  are 
identical  for  source  aggregation.  Joint  source  and  destination 
aggregation  is  simply  the  combination  of  the  separate  source  aggregation 
and  destination  aggregation.  Again,  the  principles  can  be  easily 
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extended*  As  we  have  seen,  destination  aggregation  produces  an  aggregate 
problem  that  is  a  restriction  of  (TP).  Source  aggregation  simply  further 
restricts  the  problem.  Thus  the  solution  to  a  joint  aggregated  problem 

•k 

still  yields  an  upper  bound  on  z  .  Also,  the  proposed  lower  bounds  can 
be  easily  generalized  to  this  case. 

In  a  recent  paper,  Zipkin  and  Raimer  [211  present  an  alternative 
method  of  disaggregation  which  preserves  integrality  of  the  solution.  No 
comparison  is  provided,  however,  with  the  bound  obtained  by  the  optimal 
disaggregation  method. 


4.  Computational  Results  and  Discussion 
One  of  the  main  objectives  of  the  research  was  to  provide  an 
algorithm  that  would  Initially  form  a  good  aggregate  program,  utilize  the 
information  contained  therein  to  generate  bounds  on  the  original  problem 
solution  and,  if  need  be,  to  reformulate  the  aggregate  problem  into  a  yet 
better  aggregate  problem  with  tighter  bounds.  Within  these  objectives 
lies  the  necessity  to  generate  an  algorithm  that  is  efficient,  or  at 
least  comparable  in  computation  time  to  solving  the  original  problem. 
Second,  it  is  desirable  that  the  algorithm  should  contain  the 
capabilities  to  solve  problems  larger  than  the  available  transportation 
code  can  handle.  Finally,  the  bounds  developed  on  the  optimal  values 


should  be  reasonably  tight. 


Implementation  of  the  Algorithm 

The  proposed  algorithm  was  implemented  in  its  entirety  on  the  CDC 
CYBER  170/730  mainframe  operating  under  the  NOS  1.4  operating  system.  All 
control  programs  to  run  the  various  programs  were  written  in  a  procedural 
machine  language  basic  to  the  CYBER  mainframe.  The  transportation  code 


TPGO  used  to  solve  both  the  original  and  aggregate  problems  Is  by 
McGinnis  [16].  The  code  utilizes  a  tree  representation  for  the  basis  and 
Is  reasonably  efficient.  The  program  Is  written  In  Fortran  IV. 


All  algorithms  to  develop  the  Initial  aggregate  programs  and 
reformulate  the  subsequent  aggregate  programs  were  written  In  APL  2.1014. 
This  allows  for  efficient  handling  of  the  problem's  data  structures. 


Generation  of  Test  Problems 


The  transportation  problems  were  randomly  generated  from  a  uniform 

[0,1]  distribution  of  values.  Randomly  generated  demands  and  supplies 

were  scaled  to  the  range  [5,50]  with  one  dummy  demand  set  up  to  balance 

supply  and  demand.  Geoffrlon  [9]  shows  that  a  sufficient  condition  for 

* 

an  error  bound  level  of  zero  (l.e.  z  -  z  )  occurs  when  the  cost  c, ,  can 

u  Ij 

be  written  In  a  factored  form,  l.e.  c^  *  qibj  for  each  If  the 

destinations  are  close  and  similar  in  the  type  of  deaand,  then  the  q^'s 
may  represent  a  cost/ unit-mile.  With  this  thought  In  mind,  the  costs 
were  generated  by  c^j  ■  q^b^  +  rij  w*,ere  q^«-[40,50]  and  the  parameter 
r^j e[-100,100]  to  produce  some  randomness. 


Experimental  Design 

The  problems  tested  were  divided  into  two  categories:  medium  size 
problems  (30  sources,  30  destinations);  and  large  problems  (100  sources, 
100  destinations) .  The  size  100x100  was  chosen  as  a  reasonable  extension 
over  the  medium  size  of  30x30.  The  Initial  cluster  size  was  chosen  as 
3x3  for  the  30x30  problems  and  5x5  for  the  100x100  problems.  These  sizes 
were  chosen  based  on  preliminary  results  experienced  in  the  aggregation 
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of  general  linear  programs  (see  [18]).  Finally,  all  test  problems  were 
subjected  to  both  destination  and  source  aggregation. 

Computational  Results 

The  first  set  of  problems  tested  were  those  problems  In  the  range  of 
30x30.  These  problems  were  evaluated  using  the  proposed  methodology  and 
the  computation  times  compared  with  the  time  to  solve  the  original 
problem.  The  problems  were  run  on  a  CDC  CYBER  760/140.  For  the  five 
test  problems  generated,  the  results  using  both  TPGO  and  the  proposed 
method  are  given  In  Table  1.  The  table  provides  the  error  bound  level 
and  computation  time  (cpu-seconds)  at  each  iteration  for  the  five  test 
problems.  The  average  time  to  solve  these  problems  to  optimality  by  TPGO 
was  0.796  cpu-seconds.  In  this  same  time,  the  proposed  method  produced 
aproxlmate  solutions  with  an  average  error  bound  level  of  9.02.  Although 
the  times  to  solve  the  problems  using  the  aggregate  method  are  slightly 
higher,  the  orders  of  magnitudes  of  the  solution  times  are  comparable. 

The  second  set  of  problems  tested  were  for  those  transportation 
probems  with  100  sources  and  100  destinations .  These  problems  used  an 
initial  aggregate  program  size  of  five  destinations  and  five  sources. 
These  programs  were  also  run  on  a  CDC  CYBER  760/140.  The  results  of 
these  runs  are  shown  in  Table  2.  The  solution  times  for  TPGO  are  again 
comparable  to  the  times  for  using  the  proposed  algorithm  to  an  error 
bound  level  of  10Z.  It  is  clear  that  the  algorithm  can  fully  handle 
large  type  problems,  as  the  largest  aggregate  transportation  problems 
that  was  solved  by  the  proposed  method  was  a  32  source  by  38  destintion 
transportation  problem. 
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5.  Conclusions 

The  main  objective  of  the  research  was  to  provide  a  practical  tool 
to  solve  large  transportation  problems  through  the  enhancement  of  the 
computer's  capabilities.  The  computational  results  show  that  aggregate 
programming  can  be  used  on  problems  larger  than  currently  solvable  on  a 
particular  system.  Further,  the  proposed  bounds  produce  tighter  bounds 
than  previously  available.  A  procedure  is  developed  to  reformulate  the 
aggregate  problem  and  thus  produce  tighter  bounds  at  each  subsequent 
iteration.  This  strategy  may  provide  a  more  cost  effective  approach  to 
solving  certain  transportation  problems. 


0 


0 


9 
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Table  1 


Computation  Time  for  Proposed  Algorithm  and  TPGO 


Problem  Number 


Iteration 

#1 

#2 

#3 

#4 

#5 

a 

a 

a 

a 

a 

b 

b 

b 

b 

b 

1 

.136 

.136 

.135 

.135 

.138 

20.4 

17.4 

29.2 

30.2 

34.2 

2 

.218 

.219 

.215 

.213 

.218 

17.5 

17.1 

18.1 

24.9 

20.2 

3 

.338 

.403* 

.325 

.301 

.332 

13.1 

14.2 

15.0 

16.7 

15.2 

4 

.494 

.610* 

.556* 

.537* 

.595* 

10.9 

13.0 

12.3 

9.6 

13.9 

5 

.795* 

.863* 

.840* 

.850* 

9.6 

12.2 

10.0 

7.4 

6 

1.001* 

8.3 

Time  to 


Solution 

.772 

.922 

.711 

.763 

.812 

by  TPGO 


*with  optimal  disaggregation 


a  Computation  time  (cpus) 
b  Error  bound  level  (t) 

Remarks:  All  problems  run  on  a  CDC  CYBER  760/140. 
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Table  2 


Computation  Times  for  Large  Problems  (100x100) 


Iteration 

#1 

Problem  Number 
#2 

#3 

a 

a 

a 

b 

b 

b 

1 

.234 

.221 

.242 

21.7 

54.7 

47.2 

2 

.359 

.381 

.371 

21.1 

49.3 

28.6 

3 

.505 

.537 

.512 

20.4 

37.4 

21.2 

4 

.682 

.712 

.699 

18.7 

25.2 

19.3 

5 

1.051* 

.924 

1.062* 

11.7 

18.3 

10.6 

6 

1.201* 

1.257* 

1.216* 

9.9 

8.2 

10.2 

Time  to 

Solution 

1.194 

1.186 

1.203 

by  TPGO 


*with  optimal  disaggregation 

a  Computation  time  (cpus) 
b  Error  bound  level  (%) 

Remarks:  All  problems  run  on  a  CDC  CYBER  760/140 
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